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WO 99/55908 PCT/FR99/01000 

Nucleotide sequences for the detection of 
enterohaemorrhagic Escherichia Coli (EHEC) 

The subject of the invention is two nucleic 
5 sequences of plasmid origin, present in bacteria of the 
enterohaemorrhagic Escherichia coli group (EHEC) , the 
use of the said sequences for the identification of 
EHEC(s), especially those possessing the genes encoding 
the virulence factors, enterohaemolysin and intimin, 

10 and more particularly the specific detection of 
serotype 0157 :H7. The invention also relates to a 
method using the said sequences as well as the 
detection kits containing them. 

Bacteria of the EHEC group belong to the 

15 verotoxin producing Escherichia coli or VTEC family, 
responsible for diarrhoeic syndromes whose consequences 
may be fatal in humans. In particular, EHECs can cause 
haemorrhagic colitis (HC) , and possibly the appearance 
of major complications such as haemolytic uraemic 

2 0 syndrome (HUS) or thrombopenic thrombotic purpura 

(Griffin et Tau:?ce, Epidemiol, i^ev. 13, 1991, 60-98), 

Accordingly, the effect of these infections on 
public health is such that it involves increased 
monitoring of foodstuffs and of rapid detection means, 
25 in particular in the case of epidemics. 

Several serotypes belonging to the EHEC group 
have been identified and made responsible for various 
epidemic foci: 0157 :H7, 026 :H11, 0111 :NM, 0103 : H2 , 
0145 :NM etc. {Acheson et Keush, ASM News 62^, 1996, 
30 302-306) . However, it is serotype 0157 :H7 which has 
been most frequently isolated. 

The traditional methods of detection consist in 
identifying the bacteria or in detecting the toxins 
secreted by them. The detection of E. coli 0157 :H7 is 

3 5 mainly carried out on the basis of serotyping, combined 

with the test for metabolic properties, comprising the 
absence of fermentation of sorbitol and/or the absence 
of P-glucuronidase activities. Moreover, no 



bacteriological method exists which is specific for the 
detection of EHECs, but tests which make it possible to 
orient the diagnosis. In particular, the use of agars 
supplemented with blood or washed red blood cells make 
5 it possible to demonstrate the enterohaemolytic 
character generally present in EHECs . 

In general, the bacteriological and 

immunological methods relating to the detection of 
E. coll 0157 :H7 are long, tedious, relatively 

10 expensive and require serological confirmation. 
Moreover, these methods do not make it possible to 
establish and identification of jE:. coii 0157 :H7 
because of cross-reactions with other bacterial genera 
and species, which makes the interpretation difficult. 

15 The use of nucleic probes has therefore 

appeared as an alternative to these traditional 
methods. Great efforts have been made to develop these 
probes, which are capable of detecting, in a sensitive 
and specific manner, the EHEC-type E. coll bacteria 

20 involved in the HC and/or HUS cases, and for which the 
most widespread prototype is E, coll 0157 :H7. 

In particular, probes or fragments, allowing 
the detection of the genes responsible for the 
virulence of E. coll, also called virulence factors, 

2 5 have been published. However, none of the currently 

known virulence factors makes it possible, on its own, 
to identify pathogenic strains of E. coll 0157 :H7 or 
of EHECs. 

Thus, the use of nucleic probes or fragments 

3 0 for the detection of genes encoding verotoxins (vtl or 

sti, vt2 or st2) , described by many research groups 
(Karch et Meyer, J, Clin Microbiol. 27. 1989, 2751- 
2757; Gannon et al , , Appl , Env. Microbiol. 58, 1992, 
3809-3815; Begum et al . , J, Clin. Microbiol, 31, 1993, 
35 3153-3156; Wltham et al . , Appl. Env. Microbiol. 62, 
1996, 1347-1353) , has shown that the genes encoding 
verotoxins are associated with the pathogenic bacterial 
strains E. coll 0151 ' :K1 and other EHECs, but may also 
be present in nonpathogenic E, coll strains, or 
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possibly in other bacterial types such as Shigella 
dysenteriae, Citrobacter freundii , and the like. 

Likewise, the adhesion protein called intimin 
is also involved in the virulence of EHEC-type 
5 bacteria. Probes have in particular been selected on 
the corresponding gene (eae) by Gannon et al . in J". 
Clin, Microbiol. 31, 1993, 1268-1274, Louie et al . in 
Epidemiol. Infect, 112, 1994, 449-461 and Meng et al . 
in Int. J. Food Microbiol . 32, 1996, 103-113. However, 
10 although this virulence factor is closely associated 
with the EHEC group, it is also found in 
enteropathogenic E. coli (EPEC) comprising serotype 
055 :H7. 

Finally, probes have been selected on a plasmid 

15 of 60 MDa, encoding, inter alia, enterohaemolysin, a 
virulence factor also present in many EHECs (Levine et 
al,, J. Infect. Dis. 156 , 1987, 175-182; Schmidt et 
al.. Infect. Immun. 63. 1995, 1055-1061) . Thus, patent 
US 5,475,098 relates to the nucleic sequences contained 

2 0 in the enterohaemolysin operon, corresponding to the 
hlyA and hlyB genes and to the hlyA-hlyB intergenic 
region. The claimed oligonucleotide sequences allow 
specific detection of the EHECs, but the invention does 
not make it possible to differentiate E. coli 0157 :H7 

25 from the other EHECs. Moreover, patent US 5,652,102 
describes a nucleic sequence situated on a restriction 
fragment derived from the plasmid of 6 0 MDa. However, 
the use of oligonucleotides derived from this sequence 
in a polymerase chain reaction (PGR) does not allow, on 

30 its own, the specific identification of serotype 
0157 :H7, and consequently requires the joint use of 
primers amplifying the genes encoding vero toxins and 
intimin. 

The plasmid p0157, isolated from an E. coli 
35 0157 :H7 strain obtained from a clinical sample, has 
recently been described in its entirety (Makino et al . , 
DNA Research 5, 1998, 1-9) . Mapping of the plasmid 
representing the order of the different genes on the 
genome of the plasmid indicates the presence of 186 




open reading frames (ORF) . However, the absence of data 
on the nucleic sequence (data not available at the time 
of publishing the article) in no case makes it possible 
to identify a region of diagnostic interest for the 
5 specific detection of E. coli 0157 :H7. 

Recently, patent application WO 97/32045 
relates to oligonucleotides selected from a chromosomal 
sequence obtained by the RAPD (Random Amplified 
Polymorphic DNA) method, leading to the detection of 
10 about 99.5% of E, coli 0157 :H7, but the claimed 
nucleic sequences also detect nearly 3% of non-EHEC 
E. coli, which is not satisfactory in terms of 
specificity in particular in the agri-foodstuffs 
sector . 

15 The main disadvantage of all these detection 

systems therefore consists in the fact that none of 
them makes it possible to establish clearly and simply 
the identification of the E, coli 0157 :H7 serotype. It 
is indeed very often necessary to combine several 

20 amplification and/or detection systems in order to make 
the result accurate. The protocols used are then 
difficult to carry out (multiple, simultaneous 
amplifications) and the results obtained, as regards 
sensitivity and specificity, are highly dependent ^ not 

25 only on the nucleic targets used, but also on the 
operating conditions. 

However, as was indicated above, this serotype 
can cause serious syndromes which can lead to death, 
which implies rapid and reliable means of detection, in 

30 particular in case of an epidemic. 

The work by the inventors consisted in testing 
for specific sequences from an E, coli 0157 :H7 genomic 
library, allowing the recognition of the principal 
E. coli serotypes pathogenic for humans, and more 

35 particularly 0157 :H7. The library was screened against 
enteropathogenic E. coli 055 :H7, presumed ancestor of 
serotype 0157 :H7, the two genomes being extremely 
closely related according to the polymorphism analyses 



carried out by T. Whittam et al . in Infect. Immun. 61, 
1993, 1619-1629. 

This work made it possible to isolate two 
nucleic fragments of interest for the detection of 
5 EHECs and more particularly for the detection of 
E, coli 0157 :H7, comprising the nucleic sequences SEQ 
ID No. 1 and SEQ ID No. 2, situated on the 
enterohaemolytic plasmid of 60 MDa . The corresponding 
clones pDF3 and pDF4 containing these sequences were 

10 deposited at the Collection Nationale de Cultures de 
Microorganismes of Institut Pasteur, respectively under 
the numbers 1-1999 and 1-2000, on 26 March 1998. 

Surprisingly, a first sequence (SEQ ID No, 1) 
has been identified which comprises the stable 

15 combination of a portion of the insertion sequence 1391 
and of the sequence derived from the E. coli 0157 :H7 
katP gene or of a portion thereof, the nucleic chain 
resulting therefrom, never described elsewhere, being 
specifically found in E, coll 0157 :H7. 

20 The katP gene, encoding a catalase-peroxidase , 

is present on the enterohaemolytic plasmid of E. coli 
0157 :H7 and of numerous EHECs (Brunder et al . , 
Microbiol. 142 , 1996, 3305-3315) , and the insertion 
sequence IS-9I, identified on the a-haemolytic plasmids 

25 of E. coli (Zabala et al . , J. Bacterid. 151 , 1982, 
472-476) , has still never been described in E. coli 
0157 :H7 type enterohaemolytic strains. 

The identification of a truncated insertion 
sequence at the level of the IS^I-icatP junction 

30 (absence of the left inverted repeat sequence (IRl) 
from IS:91) also suggests a stable integration of IS9i 
into this portion of the E. coli 0157 :H7 genome. 

Analysis of the amplified products of a large 
number of 0157 :H7 strains of various origins 

35 demonstrates the conservation of this nucleotide chain 
within the 0157 :H7 serotype. 

Indeed, an amplified product of 670 base pairs 
was observed in all the strains tested (55 E. coli 0157 
:H7 and 1 E. coli 0157 :H-) with the primers SEQ ID No. 



3 and SEQ ID No. 4, situated respectively in the 
sequences 1S91 and katP. 

Moreover, the data obtained from the Alul and 
Rsal restriction profiles produced on the amplified 
5 products of 5 0157 : H7 strains of different origins, as 
well as the sequence analysis performed on 3 strains, 
including 2 isolated from epidemics (USA, 1993 and 
Japan, 1996) , showed a perfect conservation, that is to 
say 100% homology, in the sequence portion analysed 
10 {SEQ ID No. 1 : positions (nt) 272 to 624) . 

Furthermore, the stable and conserved 
nucleotide chain is probably a relatively old 
recombination event which occurred in E, coll 0157 :H7 
during evolution since strains isolated in different 
15 places and periods exhibit this same characteristic. 

This sequence therefore represents a preferred 
target for the specific detection of serotype 0157 :H7. 

A second sequence was characterized (SEQ ID No, 
2) on the same plasmid, associated with the presence of 
2 0 the virulence factors enterohaemolysin (ehly) and 
intimin (eae) , characters which are specific to the 
enterohaemorrhagic E, coll strains, comprising serotype 
0157 :H7. In this regard, this fragment is of 
epidemiological interest because, unlike the methods 

2 5 already known, which require the use of several 

molecular systems {Paton and Paton, J. Clin, Microbiol. 
36, 1998, 598-602) , the use of this sequence for a 
diagnostic application results in a simplified use and 
in a more rapid interpretation of the results. 
30 The subject of the present invention is 

therefore the nucleic sequences SEQ ID No. 1 and SEQ ID 
No. 2, their complementary sequences, the sequences 
derived therefrom and the fragments which can be used 
for the specific detection of EHECs, in a food, 

3 5 clinical, veterinary or environmental sample. 

The subject of the present invention is more 
particularly a specific sequence for the detection of 
serotype E, coll 0157 : H7 , comprising the sequence SEQ 



ID No . 1, a fragment of this seqxience or a sequence 
derived therefrom . 

According to the invention ^ the sequence SEQ ID 
No. 1 comprises a nucleotide chain resulting from a 
5 stable recombination event between the sequence of the 
katP gene or a portion thereof and a truncated 
insertion sequence 1S91, 

According to the present invention, the 
expression nucleic sequence is understood to mean 
10 either the DNA or complementary DNA (cDNA) sequence, or 
alternatively the corresponding RNA sequence. 

The invention also relates to the nucleic 
sequences derived from SEQ ID No . 1 or SEQ ID No . 2, 
that is to say the sequences differing by mutation, 
15 insertion, deletion and/or substitution of one or more 
bases but nevertheless hybridizing, under conditions of 
high stringency, with one of the abovementioned 
sequences - 

According to the invention, the expression high 

2 0 stringency is understood to mean temperature and ionic 

strength conditions such that they allow specific 
hybridization between two complementary nucleic acid 
fragments and limit the nonspecific attachments 
(Saxabrook et al . , Molecular Cloning-, Second Edition 
25 (1989), 9.47-9.62), The temperature conditions are 
generally between (T^ minus 5*^C) and (Tm minus lO^C) 
when one of the hybrid sequences is short (about twenty 
nucleotides) , T^ being the theoretical melting 
temperature, defined as being the temperature at which 

3 0 50% of the paired strands separate. 

The expression nucleic sequence derived from 
SEQ ID No. 1 is also understood to mean, according to 
the invention, any sequence differing from the latter 
by mutation, insertion, deletion and/or substitution of 
3 5 one or more bases and comprising a chain for stable 
recombination between the katP gene and the truncated 
insertion sequence 1S91 . 

More particularly, the nucleic sequences 
contain at least 8, preferably 10, or most preferably 



14 consecutive nucleotides of the chain of Figure 1, 
and comprise the nucleotides from position 400 to 
position 407. 

The subject of the present invention is also a 
5 second sequence, specific for EHECs, which is SEQ ID 
No. 2, the sequences complementary thereto, the 
fragments thereof and the sequences derived therefrom, 
these sequences being always detected in EHECs, in 
particular in EHECs jointly possessing the genes 
10 encoding enterohaemolysin { ehiy) and intimin (eae); 
said sequence SEQ ID No . 2 being represented in Figure 
2 . 

The invention also relates to oligonucleotide 
fragments derived from the sequences SEQ ID No. 1 and 

15 SEQ ID No. 2, which can be used as primers in an 
amplification procedure or as probe in the context of 
the use of a method of detection, comprising at least 
8, advantageously at least 10, more advantageously 14 
nucleotides, and preferably up to 30 consecutive 

20 nucleotides of the nucleotide chain of SEQ ID No. 1 or 
SEQ ID No. 2, said primers being capable of hybridizing 
with the said sequences under high stringency 
conditions, as defined above. 

The primers or probes of the invention also 

25 comprise oligonucleotides which can be modified by 
substitution and/or addition and/or suppression of 
several nucleotides, or by the addition at one of the 
ends (generally in 5' for the primers; 3' or 5 ' for the 
probes) of a nucleic sequence which is foreign to the 

30 desired sequence, or alternatively of a labelling 
molecule, the said oligonucleotides being nevertheless 
capable of hybridizing under high stringency conditions 
with complementary nucleic sequences present in E. call 
0157 :H7 or in the EHECs. 

3 5 According to a preferred embodiment of the 

invention, the oligonucleotides may be used as primers, 
in a gene amplification procedure, leading to the 
production of a large quantity of copies of a fragment 
of SEQ ID No. 1 or of a fragment of SEQ ID No . 2 and 



allowing respectively the specific detection of E, coli 
0157 :H7 or of the EHECs . 

The amplification step may be carried out by 
any method using conventional methods of enzymatic 
5 amplification of DNA or RNA, such as in particular the 
TAS (Transcription-based Amplification System) 

technique proposed by Kwoh et al . in PNAS, 86_, 1989, 
1173-1177, the 3SR ( Self -Sustained Sequence 

Replication) technique described by Fahy et al . in PGR 

10 Meth, Appl. 1, 1991, 25-33, the NASBA (Nucleic Acid 
Sequence-Based Amplification) technique described in 
patent EP 329 822, or alternatively the SDA (Strand 
Displacement Amplification) technique described by 
Walker et al . in P.N.A.S, 89^, 1992, 392-396, or 

15 advantageously the PCR technique as described in 
particular in European patents EP 200 362 and EP 201 
184 granted in the name of Cetus, or alternatively the 
techniques derived from the latter and any other method 
desired for amplifying nucleic sequences in vitro. 

2 0 In a preferred embodiment of the invention, the 

oligonucleotides derived from the sequences SEQ ID No, 
1 and SEQ ID No . 2 are used in PCR. 

The detection of the amplified products may be 
carried out by gel electrophoresis of all or part of 

2 5 the reaction medium in which the amplification was 

carried out, in particular on agarose or polyacrylamide 
gel, or by capillary electrophoresis or chromatography. 
Visualization of a band of nucleic fragments which is 
localized at a specific point on the gel makes it 
30 possible to assess the size, it being possible for the 
intensity of this band to be roughly correlated with 
the number of initial copies of the target to be 
detected in the sample. 

According to another embodiment of the 

3 5 invention, the oligonucleotides, as defined above, may 

be used as probes in a hybridization procedure for the 
direct detection of a target nucleic sequence or, after 
amplification, for the detection of the amplified 
products. 
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By way of illustration, the nucleotide 
fragments may be labelled with a radioactive element 
(for example "^^P, ^^S, ^^H, ^^^I) or with a nonradioactive 
molecule, in particular biotin, acetylaminof luorene, 
5 f luorochrome, digoxigenin, or with an enzymatic 
molecule, or a hapten. Examples of nonradioactive 
labellings of probes are described, for example, in 
French patent by P. Kourilsky No, 78.10975, or by M.S, 
Urdea et al . , J^ucleic Acids Symp. Ser., 2£, 1991, 197- 

10 200, or alternatively by R. Sanchez-Pescador , J. Clin, 
Microbiol, 26, 1988, 1934-1938, 

The most general hybridization method consists 
in immobilizing the nucleic acid extracted from the 
sample to be analysed on a support (nitrocellulose, 

15 nylon, polystyrene and the like) and in incubating the 
immobilized nucleic acid with the probe under defined 
temperature and ionic strength conditions. After 
hybridization, the excess probe is removed and the 
hybridized molecules formed are detected by the 

2 0 appropriate method (measurement of the radioactivity, 

of the fluorescence or of the enzymatic activity linked 
to the probe) . 

The hybrid molecules formed may also be 
detected without it being necessary to separate the 

25 ''bound" and "unbound" phases. It is then said that the 
detection is carried out in a homogeneous phase. These 
methods, as described by T. Walker et al . Clin, 
Chemistry 42, 1996, 9-13 and L, Morrison (Nonisotopic 
DNA Probe Techniques, Academic Press, 1992, 312-352} 

30 relate in particular to fluorescence polarization, in 
which a probe is labelled with fluroescein and where 
the hybridization causes modification of fluorescence, 
or alternatively the transfer of energy. In the latter 
case, the detection is based on inter- or 

3 5 intramolecular interactions between two markers. A 

first marker called "donor" is excited by absorption of 
light at a particular wavelength. The energy is 
transferred to a second marker called ''acceptor", which 
in turn is excited and emits energy. 



The oligonucleotide probes may also be used in 
a detection device comprising an array arrangement of 
oligonucleotides in which oligonucleotides of a given 
length are attached in a predetermined order onto a 
support and overlap with each other by one or more 
bases; each oligonucleotide being complementary to a 
DNA or RNA sequence of the target sequence to be 
detected. The target sequence, which is advantageously 
labelled, is brought into contact with the array device 
and can hybridize with the probes attached to the 
support. An enzymatic treatment then makes it possible 
to eliminate the incomplete hybrids. Knowing the 
sequence of a probe at a determined position of the 
array, it is thus possible to deduce the nucleotide 
sequence of the target sequence analysed and to deduce 
the possible mutations which have occurred. 

An alternative to the use of a labelled 
sequence may consist of the use of a support allowing a 
"bioelectronic" detection of the hybridization of the 
target sequence with the probes attached onto the 
support of a material, such as gold, capable of acting, 
for example, as electron donor at the positions of the 
array in which a hybrid is formed. The detection of the 
target nucleic sequence is then determined by an 
electronic device. An exemplary embodiment of a 
biosensor is described in patent application EP-0721 
016 in the name of Affymax Technologies. 

According to a simple and advantageous 
embodiment, the nucleic probes may be used as capture 
probes. In this case, the probe termed "capture probe" 
is immobilized on a support and serves to capture, by 
specific hybridization, the target nucleic acid from 
the sample to be tested. If necessary, the solid 
support is separated from the sample and the duplex 
formed between the capture probe and the target nucleic 
acid sequence is then detected by means of a second 
probe termed "detection probe", labelled with a 
detectable element. ■ Advantageously, the capture and 
detection probes are complementary to two different 
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regions inside the target sequence (amplified or 
otherwise) to be detected. 

The attachment of the capture probe onto the 
solid support may be made according to methods well 
5 known to a person skilled in the art, in particular by 
passive adsorption or by covalent coupling (Coo^c et 
al.. Nucleic Acids Res, 16, 1988, 4077-4095; Nagata et 
al,, FEES Lett, 183 , 1985, 379-382; M, Longlaru et al . , 
EP 420 260 A2; T. Gingeras et al , , EP 276 302; E Homes 

10 and L.M Kornes , EP 446 260) . 

The hybridization of the capture and detection 
probes may occur separately (in two stages) or 
simultaneously (at the same time) , in particular 
according to one of the methods described by Langhale 

15 and Malcolm, Gene 36_, 1985, 201-210 or by Ranki et al , 
Gene 21^, 1993, 77-85, by Dunn and Hassel, Cell,' 12^, 
1977, 23-36 or alternatively by Ranki and Soderlund in 
patents US 4,486,539 and US 4,563,419. 

The subject of the present application is 

20 therefore also oligonucleotides derived from SEQ ID No. 
1 or from SEQ ID No. 2, selected as primers or probes, 
capable of hybridizing, under stringent conditions, 
with a target nucleic acid sequence contained in the 
tested sample, specific for E, coll 0157 :H7 or EHECs . 

25 The expression target nucleic sequence is 

understood to mean any DNA or cDNA or RNA molecule 
capable of hybridizing under high stringency conditions 
with an oligonucleotide according to the invention. 

The preferred oligonucleotides whose sequences 

3 0 are specified in the annex correspond to the positions 
on the sequences SEQ ID No. 1 and SEQ ID No. 2 reported 
in the table below: 

Position in 

Sequence SEQ ID No . 1 



SEQ ID No. 
SEQ ID No. 
SEQ ID No, 



3 
4 
5 



9 

679 
6 



- 30 

- 658 

- 30 



oiiy XJJ JMO . 


6 : 


682 


- 658 


CT?Pl TITS t\l^ 


/ : 


241 


- 263 


oHiy XJJ JMO . 


8 : 


47 


- 69 


ohjy XJJ JMO . 


9 : 


251 


- 274 


titjKj ±U JMO . 


10 : 


426 


- 401 


oiiy ±u No , 


11 : 


427 


- 402 


oJiy XD No , 


12 : 


391 


- 421 


oJti.y XU NO . 


13 : 


387 


- 417 


oJiy XJJ NO . 


14 : 


291 


- 321 


IIDlliKj XJJ NO . 


lb : 


510 


- 540 


oJ^y XX^ NO - 


1 6 : 


331 


- 350 


O-Cit^ XJJ NO . 


1 '~i 
17 : 


68 


- 87 


bJiy XD No . 


18 : 


397 


- 410 


oxlitj XJJ NO , 


19 : 


396 


- 411 


biiy ID No . 


2 0 : 


395 


- 412 


SEQ ID No . 


21 : 


718 


- 739 


SEQ ID No . 


22 : 


1099 


1078 


SEQ ID No. 


23: 


41 


- 60 - 


SEQ ID No. 


24: 


884 


- 863 


SEQ ID No. 


25 : 


928 


- 958 


SEQ ID No. 


26: 


970 - 


1000 


SEQ ID No. 


27 : 


883 


- 903 



The invention also • relates to oligonucleotide 
pairs, as described above, which can be used as primers 
for the amplification of a target nucleic sequence 
corresponding to SEQ ID No . 1 or SEQ ID No. 2, 
contained in the genome of E. coli 0157 :H7 or of the 
EHECs : 

The preferred pairs of primers are the 
following : 

• for the amplification of E. coli 0157 :H7: 

- SEQ ID No. 3 and SEQ ID No . 4 

- SEQ ID No. 5 and SEQ ID No . 6 

- SEQ ID No. 6 and SEQ ID No . 7 

- SEQ ID No, 6 and SEQ ID No . 8 

- SEQ ID No. 6 and SEQ ID No . 9 

• for amplification of the EHECs: 

- SEQ ID No. 21 and SEQ ID No. 22 
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- SEQ No. 23 and SEQ ID No. 24 
Thus, the use of the pair of primers SEQ ID No. 
5 and SEQ ID No . 6 for carrying out the amplification 
of the nucleic acid of E. coli 0157 :H7 leads to the 
5 amplification of a nucleic fragment of 676 bp, 
characteristic of the E, coli 0157 :H7 strains. The 
specificity of this fragment may be controlled, where 
appropriate, by the use of the probe SEQ ID No. 18. 

Likewise, the use of the pair of primers SEQ ID 
10 No. 21 and SEQ ID No. 22 specifically amplifies a 
nucleic sequence of 3 82 bp, present in EHECs, which 
also possesses the enterohaemolysin and intimin 
characters. The bacteria belonging to the other E. coli 
groups, such as the ETECs ( enterotoxin-producing 
15 E. coli) , the EPECs and the like, are not detected- The 
specificity of the amplified product can moreover be 
confirmed with the aid of oligonucleotide probes 
internal to the amplified fragment, such as SEQ ID 
No. 27. 

2 0 The subject of the present invention is also 

oligonucleotides, as described above, which can be used 
as probes for the detection of an optionally amplified 
nucleotide sequence. For example, the oligonucleotide 
sequences SEQ ID No. 14, SEQ ID No. 15 and 

25 SEQ ID No. 18 may be used for the specific detection of 
E. coli 0157 :H7. Likewise, the use of the sequences 
SEQ ID No. 25, SEQ ID No . 2 6 and SEQ ID No. 27 allows 
the detection of EHECs including 0157 :H7. 

The subject of the invention is also the 

3 0 plasmids containing the sequences SEQ ID No . 1 and 

SEQ ID No. 2 mentioned above as well as the host cells 
containing them. 

The invention also relates to a method for the 
in vitro detection of E. coli 0157 :H7 or EHECs in a 
3 5 sample, characterized in that it comprises the 
following steps: 

1. bringing the sample into contact with one of 
the pairs of primers, as described above, the nucleic 
acid contained in the sample having been, where 



appropriate, made accessible to the hybridization of 
the primers with the nucleic acid of the target tested 
for, 

2- amplifying the nucleic sequence flanked by 
the pair of primers chosen, 

3. it being possible to carry out the 
verification of the possible presence of the amplified 
product according to a method known to persons skilled 
in the art, as described above. 

According to an advantageous embodiment, the 
amplified fragments may be detected according to the 
principle of the so-called sandwich" method. 

Also falling within the scope of the invention 
is a method for the in vitro detection of previously 
amplified nucleotide sequences specific for 
E. coli 0157 :H7 or EHECs, by detection on a support, 
for example a microtiter plate and, characterized in 
that it comprises the following steps: 

- denaturation of the amplified sequence of 
E. coli 0157 :H7 and/or EHECs by a physical or chemical 
means. The addition of a denaturing solution composed 
of 200 mM NaOH, 40 mM EDTA will be preferred, 

- bringing denatured amplified fragments into 
contact, in an appropriate hybridization buffer, with, 
on the one hand, at least one capture probe attached to 
the support, and on the other hand, at least one free 
detection nucleic probe in the hybridization buffer, 
optionally labelled, capable of hybridizing with the 
same strand of the amplified fragments as that with 
which the capture probe is hybridized, but in a region 
different from that hybridized with the capture probe; 
it being possible for the said hybridization solution 
to be advantageously 5-fold concentrated SSPE (Sodium 
Saline Phosphate EDTA; Molecular Cloning, A practical 
guide, SamJorook et al , , Vol. 3 , 1989, annexe B13) , 0.5% 
Tween 20, 0.01% Merthiolate, 

- incubation of the reaction mixture for a 
sufficiently long period to allow the hybridization; it 
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being possible for this incubation, for example, to be 
advantageously performed at 37^C for about 1 hour, 

- one or more washings of the preceding 
mixture, in order to remove any unreacted nucleic 

5 sequence; it being possible for the said washings, for 
example, to be carried out with a solution containing 
10 mM Tris-HCl, 300 mM NaCl and 0.1% Tween 20, pH 7.4, 

- visualization of the detection probes 
hybridized with the amplified nucleic sequences. 

10 According to an advantageous embodiment of the 

invention, the detection probe is labelled with 
peroxidase, and the activity of the peroxidase linked 
to the hybridized detection probe is visualized by 
color imetric reading, in the presence of a chromogenic 

15 substrate, according to the following steps: 

- deposition of a solution containing a 
chromogenic substrate, such as tetramethylbenzidine 
(TMB) , in each of the wells containing the reaction 
mixture, and incubation, in the dark, of the microplate 

20 for a sufficient period, generally 20 to 30 min, and 
then the reaction is stopped by the addition of a 
blocking solution, the said solution being 
advantageously an H2SO4 solution used at a final 
concentration of 0.5 N, 

25 - determination of the optical density, the 

said determination being carried out at a wavelength of 
450 nm (reference 620 nm) when TMB is used as 
chromogenic substrate . 

According to a particularly advantageous 

3 0 embodiment, the capture probe used for the detection of 
E. coli 0157 :H7 may be SEQ ID No* 15 and the detection 
probe is the oligonucleotide SEQ ID No. 18. Likewise, 
the capture probe used for the detection of EHEC 
bacteria may be SEQ ID No. 2 5 and the detection probe 

3 5 of the oligonucleotide SEQ ID No. 27. 

The invention also relates to a detection kit, 
for the identification of E. coli 0157 :H7 or EHECs, 
contained in a sample, comprising among the reagents: 
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- at least two oligonucleotides as defined 
above, used as primers for the amplification of 
E, coli 0157 :H7 or of the bacteria of the EHEC group, 

- optionally, a component for verifying the 
5 sequence of the amplified fragment, more particularly a 

nucleic probe as defined above. 

The following e:?camples are given without 
limitation to illustrate the invention, 

10 EXAMPLE 1 

Characterization of the sequences SEQ ID No . 1 and 
SEQ ID No. 2 

1) Construction of the E. coli 0157 : H7 genomic 

library : 

15 The genomic DNA for the E, coli 0157 :H7 strain 

isolated from stools from a patient suffering from 
haemorrhagic colitis and producing type 1 and 2 
verotoxins was partially digested with the endonuclease 
Pstl (Boehringer Mannheim, Ref . 621625) by allowing 

20 0.03 enzyme unit to act per ^ig of DNA in buffered 
medium for 1 hour at 37 °C . The genomic DNA thus 
digested made it possible to generate fragments of 35- 
45 kb. The cosmid pHC79 {Hohn and Murray, Proc . Natl. 
Acad, Sci 7±, 1977, 3259-3263) was digested in the same 

2 5 manner and dephosphorylated so as to avoid any self - 

ligation . 

The ligation was carried out by mixing 900 ng 
of vector and 2.6 fxg of DNA fragments of 35-45 kb (that 
is a vector/insert molar ratio of 2), the reaction 

3 0 medium being left at 14 ^C for 18 hours after having 

been supplemented with 2 units of T4 DNA ligase 
{ Boehringer Mannheim ; Ref . 4 812 2 0). The recombinant 
cosmids were encapsidated in vitro and used to 
transform the E. coli XLl-Blue MR bacteria (Stratagene; 
3 5 Ref . 2 003 00) . The transformed bacteria were incubated 
for 1 hour at 37°C in LB medium (Luria-Bertani , 
Molecular Cloning, A practical guide, Sambrook et al , , 
Vol- 3 , 1989, annexe Al) . The DNA fragments of 35-45 kb 
being inserted into the vector pHC79 so as to abolish 
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the ampicillin resistance site and to conserve the 
tetracycline resistance site , the bacteria were then 
plated on selective agar medium containing 12.5 fig/ml 
of tetracycline . 
5 Mini preparations of cosmid DNA were produced 

from the first 360 colonies isolated on tetracycline 
using the REAL Prep96 Kit distributed by Quiagen 
(reference 26171) . 

The DNA of these preparation was then digested 

10 with the endonucleases Pstl, EcoRI and Sail (Boehringer 
Mannheim, Ref . 621625, 703737 and 567663), analysed by 
electrophoresis on 1.2% agarose gel and then 
transferred onto Hybond N"^ nylon filter (Amersham, Ref. 
RPN 3 03B) . The DNA was irreversibly fixed by exposure 

15 to UV for 5 min. 

2) Screening of the library: 

The hybridizations were carried out with a 
homologous DNA probe obtained from the E. coll 0157 : H7 
strain (Collection de I'Institut Pasteur, No. 103571) 
2 0 and with a heterologous DNA probe consisting of a 
''pool" of DNA obtained from 8 E, call 055 : H7 strains 
(Collection de I'Institut Pasteur, No. 105215, 105216, 
105217, 105228, 105239, 105240, 105241, 105242). 

The various filters were hybridized for 16 to 

2 5 18 hours at 6 5*^C in a solution containing 6- fold 

concentrated SSC buffer ( Sodium Saline Cl trate; 
Molecular Cloning, A practical guide, Sambrook et al . , 
Vol . 3 , 1989, annexe B13) , 5-fold concentrated 
Denhart ' s solution {Molecular Cloning, Vol. 3 , 1989 , 
3 0 anne:x:e B15) , 10% Dextran sulphate ( Pharmacia Biotech, 
Ref . 17-0340-02) , 10 mM EDTA, 0.5% SDS, 100 Mg/ml 
single-stranded salmon sperm DNA and the relevant DNA 
(0157 :H7) . 

After hybridization, the filters were washed 

3 5 twice 10 min in a 2 - fold concentrated SSC buffer at 

65^C, once 30 min in a buffer containing 2-fold 
concentrated SSC and 0.1% SDS at 65^C and then once 
10 min in SSC diluted 1/10 at 65°C. The filters, which 
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are still wet , were exposed in a cassette with an 
intensifying screen for 2 4 to 4 8 hours at -80 . 

After the necessary exposure time , the films 
were developed and then the nylon membranes 
5 dehybridized by performing 4 to 5 bathing cycles at 
45°C, with stirring. For each cycle, two successive 
baths of 3 0 min in a 0.5 N NaOH solution and then 
30 min in a buffer containing SSC diluted 1/10 and 0 . 1% 
SDS were performed. The membranes were finally washed 
10 in 2-fold concentrated SSC and placed in a cassette in 
order to verify that no traces of hybridization remain. 
After dehybridization, the filters were hybridized in 
the same manner as above with a pool of nonre levant DNA 
(055 :H7) . 

15 3) Isolation and cloning of the fragments 

SEQ ID No . 1 and SEQ ID No . 2 : 

The results of these hybridi zations made it 
possible to identify two cosmid clones from which one 
fragment of about 1 to 2 kb, hybridizing with the 

2 0 homologous probe and not hybridizing with the 

heterologous probe, was isolated respectively. After 
having verified their conservation in various 0157 : H7 
strains by "dot-blot" hybridization, these fragments 
were cloned into a vector pUClB (Oncor Appligene Ref . 
25 161131) and then prepared in a large quantity . The 
recombinant plasmids were called pDF3 and pDF4 and 
correspond respectively to the sequences SEQ ID No, 1 
and SEQ ID No . 2 . 

4) Determination of the sequences SEQ ID No. 1 

3 0 and SEQ ID No . 2 

The fragments were sequenced according to the 
Sanger et al . method described in Proc. Natl, Acad, 
Sci , _74, 1977, 5463, using the "universal primer" and 
the "reverse primer " of the plasmid pUC18 , as well as 
35 oligonucleotides internal to the sequences. 

The sequence SEQ ID No. 1 (Figure 1), 
containing 1489 bp, exhibits 99.9% homology with the 
JcatP gene of B. coli^ 0157 :H7 in the region 407 to 1489 



- 20 - 

and a 95.8% homology with the XS91 of E. coli in the 
region 1 to 406. 

The analysis of the sequence SEQ ID No. 2 
(Figure 2), containing 1181 bp, reveals no known 
sequence of the enterohaemolytic plasmid. Only the 
portion 237 to 570 exhibits 68% homology with the virK 
plasmid gene encoding a Shigella fle:>cneri virulence 
protein . 

EXAMPLE 2 

Specific detection of E. coli 0157 :H7 

The specificity study was performed on 100 
E, coli strains of different serotypes and 42 non- 
E, coli strains comprising, inter alia, bacteria 
capable of cross-reacting with E, coli 0157 :H7 such as 
for example Salmonella, Shigella dysenteriae, 
Citrohacter freundii, Hafnia alvei, Escherichia 

hermanii . 

1) Extraction of the DNA: 

The DNA sequences are obtained by the method of 
boiling in the presence of Chelex (InstaGene™ Matrix, 
Biorad) . The samples were prepared according to the 
following protocol : 

A bacterial suspension is produced in sterile 
ultrapure water from several bacterial colonies 
isolated on Tryptone-Casein- soybean agar (Sanofi 
Diagnostics Pasteur, Ref . 53455), and then centrifuged 
at 10,000-12,000 revolutions /min for 2-3 min and the 
supernatant carefully removed. The bacterial pellet is 
resuspended in 200 ^1 of lysis reagent, homogenized and 
then incubated in a heating block at 100°C for 10-15 
min. The sample is again homogenized and then 
centrifuged at 10,000-12,000 revolutions/min for 2-3 
min. The DNA can be amplified directly or stored at 
-20°C. 

2) Amplification by PGR: 

The amplification reaction is carried out in a 
total volume of 15 ^1 containing 50 mM KCl; 10 mM Tris- 
HCl pH8.3; 0.01% gelatin; 3 mM MgCl2; 0.25 mM of each 



primer SEQ ID No . 5 and SEQ ID No . 6 ; 10 0 /iM (dATP, 
dCTP, dGTP) ; 400 mM dUTP; 0.5 unit of Uracyl-DNA- 
Glycosylase (UDG; BRL Life Technologies); one unit of 
Taq DNA Polymerase (BRL Life Technologies) and 5 ijlI of 
DNA prepared as indicated in paragraph 1. 

After incubating at 50 °C for 2 min and then at 
950c for 5 min, the samples are subjected to 35 
amplification cycles composed of 15 sec at 95°C, 15 sec 
at 65^C and 15 sec at 72^0 . The tubes are kept at 72^0 
until the plate is removed. 

The thermal cycles are performed in a ''Perkin- 
Elmer 9 600" thermocycler . 

Each experiment comprises a positive control 
and a negative control . 

3) Visualization of the amplified products: 

The amplification reactions are visualized on 
agarose gel or detected on microplate. 

3-1) Agarose gel: 

After amplification, 15 fil of chloroform are 
added to each sample in order to inactivate the UDG and 
then one aliquot of each reaction is analysed by 
electrophoresis on 1.2% agarose gel stained with 
ethidium bromide, in the presence of a size marker. 
Visualization of a DNA fragment at 67 6 bp indicates the 
presence of E.coli 0157 : H7 in the sample tested. 

3-2) Hybridization in microplate: 

The amplification products are denatured by 
addition, volume for volume, of a solution containing 
200 mM NaOH, 40 mM EDTA. The microplate, in which the 
surface of the wells is coated with the capture probe 
SEQ ID No. 15, is prehybridized in a hybridization 
buffer containing 5-fold concentrated SSPE, 0.5% 
Tween 20 and 0.01% Merthiolate. Next, the microplate is 
emptied and each of the wells receives 200 ^.l of 
hybridization buffer containing the denatured amplified 
fragment and the revealing probe SEQ ID No. 18. The 
incubation is performed at 37°C, with stirring, for 
1 hour . 
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The wells are then washed six times with 400 ^.l 
of solution {10 inM Tris-HCl pH 7.4; BOOmM NaCl and 0.1% 
Tween 20), and then the activity of the peroxidase 
bound to the probe is detected by adding to each well 
5 200 Ml of a detection solution containing the 
chromogene tetramethylbenzidine (TMB) . The microplate 
is incubated at 37<^C, in the dark, for 3 0 min and then 
100 Ml of a 1.5 N H2SO4 solution are added in order to 
block the reactions. The optical density values are 
10 determined at 450 nm against a reference at 620 nm. 
4) Study of specificity: 

The tests were performed on a total of 142 
bacterial strains, using the pair of primers 
SEQ ID No. 5 and SEQ ID No . 6 for the PGR amplification 
15 step, the capture probe SEQ ID No. 15 and the detection 
probe SEQ ID No. 18 for the hybridization step on 
microplates . 

The results obtained on microplates with the 
E. coll strains and non-£;. coll strains (bacteria of 
20 different genera and species) are presented 
respectively in Tables I and II below: 
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Table I 



E. coli strain 


Number o£ strains 


xrV^Xv oiiiV^ X-i^ JMO • m> / O 


(serotype) 


tested 




VTEC/EHEC 






0157 :H7 


55 


+ 


0157 :H- 


1 


+ 


026:H11 


10 




u JL X X . n 


2 




0145 :H- 


2 


- 


O103 :H2 


2 


— 


0121:H19 


1 


_ 


0165 :H25 


1 




045 :H2 


1 




022 :H8 


2 




0137 :H41 


1 


— 


091 :H21 


1 




0141 :H4 


1 




EPEC 






05 5 :H7 


8 


— 


055 :H6 


1 


— 


055 :H- 


1 




vJX X X . rl — 


1 




0111 :H2 


1 


- 


0111 :H12 


1 


— 


0128 :H2 


1 




0127 :H6 


1 


- 


ETEC 






0157 :H19 


1 




0159 :H34 


1 




CIP 81.86 


1 




E. Coli 






CIP 76.24 


1 




CIP 54.8 


1 





5 



The ( + ) results correspond to OD450 > 2.5. 
The (-) results correspond to OD450 < 0.05. 
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Table II 



Straxn 


Number of 


PCR 


(bacterial species) 


strains tested 


SEQ ID No. 5/6 


Salmonella 






Salmonella (Groups I to VI) 


10 


— 


Shigella 






Shigella flexneri 


2 


— 


Shigella dys enter iae 


1 




Shigella sonnei 


1 


- 


Others 






Escherichia hermanii 


2 


- 


Citrobacter f reundii 


2 


- 


Yersinia enterocoli tica 


2 


- 


Yersinia pseudotuberculosis 


1 


- 


Hafnia alvei 


1 


- 


Proteus mirabilis 


1 


- 


Proteus vulgaris 


1 


- 


Serratia marcescens 


1 


- 


Klebsiella pneumoniae 


2 


- 


Klebsiella oxytoca 


1 




Enterobacter cloacae 


1 


- 


hin cerooac L.er aei. oyeiitib* 


1 

X 




Enterobacter agglomerans 


1 




Bacillus subtilis 


1 




Morganella morganii 


1 




Providencia alcalif aciens 


1 




Vibrio parahaemolyticus 


1 




Acinetobacter baumanii 


1 




Shewanella putref aciens 


1 




Pseudomonas aeruginosa 


1 




Pseudomonas f lucres cens 


1 




Listeria monocytogenes 


3 





5 The (+) results correspond to OD450 > 2.5. 

The (-) results correspond to OD450 < 0.05. 



In conclusion, only the 0157 :H7 and 0157 :H- 
strains are detected on microplates with the 
10 abovementioned system. 
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EXAMPLE 3 

Specific detection of EHECs 

The specificity was tested on a total of 142 
5 bacterial strains including various serotypes of 

E. coli as well as other bacterial species which can 

interfere with the detection of the EHECs. 

The DNAs were extracted according to the 

protocol described in the first paragraph of Example 2 . 
10 The amplification conditions are the following: 

The reaction is carried out in a total volume of 50 /il 

containing 50 mM KCl; 10 mM Tris-HCl pH 8.3; 

1.5 mM MgCl2; 0.5 of each primer SEQ ID No. 21 and 

SEQ ID No. 22; 2 00 /iM (dATP, dCTP, dGTP, dTTP) ; one 
15 unit of Taq DNA Polymerase (BRL Life Technologies) and 

5 jul of DNA prepared as indicated in paragraph 1 of 

Example 2 . 

The thermal cycles are performed in a "Perkin- 
Elmer 9600" thermocycler . 
20 Each experiment comprises a positive control 

and a negative control. 

The amplification products were visualized on 
agarose gel stained with EtBr, the presence of a band 
at 3 82 bp indicating the presence of EHEC in the sample 
25 tested. 

The results are presented in Table III. 

Only the strains exhibiting the ehly and eae 
characters, virulence factors frequently associated in 
the strains isolated from human infections, are 
3 0 detected by PGR with the pair of primers SEQ ID No. 21 
and SEQ ID No. 22 . 

Furthermore, the use of the said pair of 
primers also makes it possible to detect in particular, 
by means of a single amplification reaction, the 
3 5 E. coli strains possessing the genotype (vf' eae"^ and 
ehly") , characteristic of the enterohaemorrhagic 
E. coli. 
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Table III 



Strain 
(serotype) 


Ntiihber of 
strains 
tested 


Genotype 


PGR 
No. 


SEQ ID 
21/22 


VTEP /EHEC 














0157 :H7 


54 


vt+ , 


e]:ily+ , 


eae+ 








1 


vt- , 


ehly+ , 


eae+ 




+ 




1 


vt+ , 


ehly+ , 


eae+ 




+ 


\J ^ \J . J.1 -L -L. 


5 


vt+ , 


ehly+ , 


eae+ 




+ 




1 


vt-. 


ehly+ , 


eae+ 






02 6: H- 


1 


vt+ , 


ehly+. 


eae+ 






W X X -L . iJ. 


1 


vt+ , 


ehly+ , 


eae+ 






W X *± . iT- 


2 


Vt4- , 


ehly+ , 


eae+ 






O103 :H2 


2 


vt+ , 


ehly+ , 


eae+ 






0121 :H19 


1 


Vt4- , 


ehly+ , 


eae+ 






ni S * H2 5 


1 


Vt+ , 


ehly+ , 


eae+ 








1 


Vt+ , 


ehly+ , 


eae+ 




+ 


KJA^ .no 


2 


Vt+ , 


ehly+ , 


eae- 




- 


^jX-5 / .rifix 


1 


Vt+ , 


elily+ , 


eae- 




- 


HQ 1 • H9 1 


1 


Vt+, 


e]nly+ , 


eae- 




- 


pv9 . "Rl 1 


3 


vt+ , 


enly- , 


eae+ 




- 


0111 :H- 


1 


vt+ , 


ehly- , 


eae+ 




- 


0141 :H4 


1 


vt+ , 


ehly- , 


eae — 






EPEC 














055 : H7 


8 


vt- , 


ehly- , 


eae+ 




- 


055 :H6 


1 


vt- ; 


eh-ly- , 








055 :H- 


1 


vt-, 


ehly- , 


eae- 






0111 :H- 


1 


vt-, 


ehly- , 


eae- 




- 


0111 :H2 


1 


vt-, 


ehly- , 


eae+ 




- 


0111 :H12 


1 


vt- , 


ehly- , 


eae- 




— 


012 8 :H2 


X 


vt-. 


ehly- , 


eae+ 






0127 :H6 


1 


vt- , 


ehly- , 


eae+ 




- 


ETEC 














0157 :H19 


1 


vt-, 


ehly- , 


eae- 






0159 :H34 


1 


vt-. 


ehly- , 


eae- 






CIP 81.86 


1 


vt-. 


ehly- , 


eae- 






E. coli 














CIP 76.24 


1 


vt- , 


ehly- , 


eae- 






CIP 54.8 


1 


vt~. 


ehly- , 


eae- 






non-E. coli 


42 













32 - 



1 

51 

101 

151 

201 

251 

301 

351 
IS91 
401 

451 

501 

551 

601 

651 

701 

751 

801 

851 

901 

951 

IDQl 

1051 

1101 

1151 

1201 

1251 

1301 

1351 

1401 

1451 



CTGCAGTCCG 
CCGCAGGCCA 
ACCTTCCGGT 
AAATGTCGGC 
GCCAGTGAAC 
CCGGCAAGAA 
ATACCCTTCG 
TGACATCCAC 



GAGATGAuAAG 

TGATTTTTGT 

TGTTTCTCCA 

AAAACGGGGA 

CGTGCCACAC 

TGGTCGCAAA 

TCATCAACAC 

AAAGAAGGGA 
^ katP 



TCGTCAGTTG 
TCCCAGCi^T 
CAACATGCGA 
TCTTCCTGTT 
TAGCCGCTGA 
GATTTAACTC 
TGATTTTGAT 
AAAAAGATAT 
GATTATGGTC 
CGGAACATAC 
AACGTTTTGA 
GCCCGTCGAT 
CTGGGGAGAC 
GATTTAAAAC 
GACCTGGTAT 
TAAAAACGGG 
TTTATGTCAA 
GCGAAAGATA 
GACTGTGGCC 
CAGCGTCTCC 
GAGGAGCAGG 
ATATACCATC 



CGGCTTGGAA 
AGGTATATCC 
ATTTCCCTTA 
CTGATTCTTC 
TAAAAAAGAG 
CTCTGAGATT 
TATGCCACCA 
CAAAGATTTG 
ATTATGGTCC 
AGGACATATG 
ACCGCTGAAC 
TGCTGTGGCC 
CTGATGGTCC 
GCTGGGATTT 
ACTGGGGGCC 
AAACTTCAGA 
TCCTGAAGGC 
TCAGGGAAGC 
CTGATCGCGG 
TGAAAAATGT 
GACTGGGATG 
ACCAGTGGCC 



CACCACTGTG 
CACAGACTCa 

gccagttaag 
agcatcagaa 
tccgggcagt 
ctccccctcc 
gtacaaacca 
atattcaggt 
ccccttagta 
tgttgcatca 
atccggagct 
tggcgctatc 
actcaaaatt 
acacagccct 
gatttcaaca 
• ctgacaactt 

TTTCTTTATT 

atggccgggg 
agctggccgg 
agtcaagaaa 
tgactggtaa 
gctggcggaa 
tgacaacaag 
aacctcttgc 
cccggtggaa 
tttttcacgt 
gagggcatac 

ATTGGCGCAG 

gaaaaataaa 
tggaaggagc 



tgtaccccat 

ATGACTACCG 
CCAGCGGTTT 
GGGCGGGGGA 
ACATGCCGCC 
GTGCAGCGGG 
GAAGACCAGC 
CTGCGCAGCA 
TTTTTTGTCT 
ATAAAGTTGA 
ATTCGTATGA 
GGGGAGCTTT 
TCTACTATCC 
GAATCAAATC 
GCTGGATATG 
CCCAGGATTG 
CGTATGGCTT 
AGGCGCCAGT 
ATAACGTTAA 
AAATACGGCT 
TGTTGCCCTT 
GAGAAGATGA 
CCTCTTGCAG 
CGCCACGCAG 
AACCAGATCC 
ATGGCCATGG 
ATTTGGTAAA 
GGCCTGATGG 
TGTGGTACAG 
CTGGTCGAC 



CAGCGTGGTC 
GACGCACTGA 
CCCTGCTGAA 
ACTCCGTCCG 
GGCGCTGATA 
CTATTTCAGG 
TTTTTGTTTC 
CTCAACGGCA 
GTAGTATCTA 
CTTTTGTATA 
TAAAAAAAAC 
TCTACCGCTG 
AGAAACACTG 
CCTGGGGGGC 
GAGGCTCTGA 
GTGCCCTGCG 
GGCACGGTGC 
GGTGGTCAGC 
TCTGGATAAA 
CCAGTATTTC 
GAATCCATGG 
CTGGGAGTCG 
ATAACCGGGA 
ATGGGACTTA 
TCTGGCTTCC 
ATGATGAGGA 
GCACATGGTG 
TGCACCTGTG 
GAAACGGCAA 



FIG. 1 
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851 
901 
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CTGCAGGAGA 
GCTCCAGCTG 
GTCCCCGTAT 
TGAACAAAAT 
CCCTTAGCCT 
ATCAAACCGG 
AGCGTTCCGG 
ACATCCCCCA 
AGGGATCCCG 
CTTCAGCCAG 
ACACTGGCAA 
TAACACATCT 
CTTCGTATTC 
CCTTCCATCT 
TTCGGGACGT 
ATGCAGCCCA 
TGCAATGCGG 
GACACCTTCC 
TGAAATCACA 
TAGGTTTTTT 
CAGTCCTGCT 
GTTTAAATTC 
AGGCTGTTCC 
TTCCGGAGCA 



TGGAAAAAAA 
AAAGTAGGCC 
TTAAACAATG 
TCGGGCTGAA 
GACTAGCCAC 
TACTCAATAT 
TCGGGATAAA 
CGGGT AACAG 
CCCGTCTCAC 
TACCATACCA 
TCCGGTAGAC 

ccgctcattc 
ttcacgcccg 
tcagctgata 
agcatcccca 
ggcgtggata 
gccagtcgaa 

GCCATCAGAT 
CAGATAATTC 
GTCTGACAAT 
ATATTACTGT 
CCCGATAATC 
TGAAAGGAAT 
ATTTTACTTT 



GCCAAAATAA 
TGTTCTGTCC 
TGATAAATTA 
AAGAGGATCC 
AGACACAATG 
CTTCTCTGGC 
AAATCGCGCA 
CGTCCCTGTC 
TGGCGATAAC 
AACGCTTCAT 
CGGTAACGCT 
CCAGGTGTTC 
GCGCCCACCA 
CAATACACGC 
CCTGAACGAT 
TGCAGGGGTA 
ACCCGGTGGA 
GCGCCATCAT 
AGGGAAAACG 
AGTGAAGCGG 
CATGGCCACT 
CGTCGAAGTC 

aaaagtgaca 
tttctctgca 



AAAATTGCCC 

GGTATTTAAA 

CTCCGTTACC 

GCCGTTATCT 

ATCTGTGCCG 

GCTGGCTGCC 

GTGCGCCGGT 

ACATTCTTCT 

GGGCACGCCG 

TTTCCGAAGG 

GGGAAAAGGG 

TGTCTGCTGA 

CGAGCCAGCG 

AGCATAAATT 

AAGCGGAACA 

ACGGTCGCAT 

ATAACCGTTA 

GGGTGAGATA 

TTCTGGTCTT 

TGACAGCATA 

ATGGCAGATG 

TGAGGATGGA 

TCATGCCCTC 

G 



ATCCCAGCGC 
TGCATTGACC 
GGAAAACCGC 
GTTGCATTTC 
TTCTGTTAAT 
ATCATCCGGA 
CCATGCAGAC 
GAATGACATC 
GAGACTGACG 
CATGACCACC 
CACCTGCCAT 
CGCAGACGTG 
AAATGATTTC 
CATGTCCTTT 
TTGTCTGCTG 
GGCTTCATTA 
CCGGTGTCCT 
GGCACAACAA 
ACGGGTGATG 
TCAGACGGCT 
ACCAGATCAG 
AGGAAGGTGA 
TTTTTCTGGC 



FIG. 2 
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SEQUENCE LISTING 

(1) General information 

(i) Applicant: 

(A) Name: Pasteur Sanofi Diagnostics 

(B) Street: 3 Boulevard Raymond Poincare 

(C) City: Marnes La Coquette 

(D) Country: France 

(E) Postal Code: 92430 

(ii) Title of invention: Nucleotide sequences for 
the detection of enterohaemorrhagic Escherichia coli 

(iii) Number of sequences: 27 



15 



(2) Information for sequence ID No. 
(i) Sequence characteristics: 

(A) Length: 1489 base pairs 

(B) Type: Nucleic acid 
20 (C) Strandedness : double 

(D) Topology: linear 

(E) Strand: sense 



25 



(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No, 1: 

1 CTGCAGTCCG GAGMGA.-J^.G CACCJ^.GTGTG TGTACCCCAT C?.GCGTGGTG 

51 CCGCAGGCC-A TGATTTTTGT CACAGACTCA ATGACTACCG GACGCACTG.-, 

101 ACCTTCCGGT TGTTTCTCCA GCCAGTTAAG CCAGCGGrTT CCCTGCTGA-. 

151 AAA7GTGGGC AA^CGGGGA AGCATCAGAA GGGCGGGGGA ACTCCGTCCG 

2 01 GGCAGTGAAC CGTGCCACAC TCCGGGCAGT ACATGCCGCC GGCGCTGAT- 

Z5 1 CGGGCAAGAA TGOTCGC?J\.h CTCCCGCTCC GTGCAGCGGG CTATTTCAG^ 









35 - 






301 


ATACCCTTCG 


TCATCAACAC 


GTACAAACCA 


GAAGACCAGC 


- 

TTTTTGTTTC 


351 


TGACA.TCCAC 


AAAGAAGGGA 


ATATTCAGGT 


CTGCGCAGCA 


CTCAACGGCA 


401 


TCGTCAGTTG 


CGGCTTGGAA 


CCCCTTAGTA 


TTTTTTGTCT 


GTAGTATCTA 


•^51 


TCCCAGCAAT 


AGGTATATCC 


TGTTGCATCA 


ATAAAGTTGA 


CTTTTGTATA 


501 


CAACATGCGA 


ATTTCCCTTA 


ATCCGGAGCT 


ATTCGTATGA 


TAAAAAAAAC 


551 


TCTTCCTGTT 


CTGATTCTTC 


TGGCGCTATC 


GGGGAGCTTT 


TCTACCGCTG 


601 


TAGCCGCTGA 


TAAAAAAGAG 


ACTCAAAATT 


TCTACTATCC 


AGAAACACTG 


1 V — 


GATTTAACTC 


CTCTGAGATT 


ACACAGCCCT 


GAATCAAATC 


CCTGGGGGGC 


701 


TGATTTTGAT 


TATGCCACCA 


GATTTCAACA 


GCTGGATATG 


GAGGCTCTGA 


751 


AAAAAG AT AT 


CAAAGATTTG 


CTGACAACTT 


CCCAGGATTG 


GTGCCCTGCG 


SOI 


GATTATGGTC 


ATTATGGTCC 


TTTCTTTATT 


CGTATGGCTT 


GGCACGGTGC 


S51 


CGGAACATAC 


AGGACATATG 


ATGGCCGGGG 


AGGCGCCAGT 


GGTGGTCAGC 


VOI 


AACGTTTTGA 


ACCGCTGAAC 


AGCTGGCCGG 


ATAACGTTAA 


TCTGGATAA-^. 


951 


GCCCGTCGAT 


TGCTGTGGCC 


AGTCAAGAAA 


AAATACGGCT 


CCAGTATTTC 


1001 


CTGGGGAGAC 


CTGATGGTCC 


TGACTGGTAA 


TGTTGCCCTT 


GAATCCATGG 


105 1 


GATTTAAAAC 


GCTGGGATTT 


GCTGGCGG.-A 


GAGAAGATGA 


CTGGGAGTCG 


1101 


GACCTG3TAT 


ACTGGGGGCC 


TGACAACAAG 


CCTCTTGCAG 


ATAACCGGGA 


1151 


TAAAAACGGG 


AAACTTCAGA 


AACCTCTTGC 


CGCCACGCAG 


ATGGGACTTA 


12 01 


TTTATGTCAA 


TCCTGAAGGC 


CCCGGTGGAA 


AACCAGATCC 


TCTGGCTTCC 




GCGAAAGATA 


TCAGGGAAGC 


TTTTTCACGT 


ATGGCCATGG 


ATGATGAGGA 


1301 


GACTGTGGCC 


CT GAT CGCGG 


GAGGGCATAC 


ATTTGGTAAA 


GCACATGGTC- 


1351 


CAGCGTCTCC 


TGAAAAATGT 


ATTGGCGCAG 


GGCCTGATGG 


TGCACCTGTG 


1401 


GAGGAGCAGG 


GACTGGGATG 


GAAAAATAAA 


TGTGGTACAG 


GAAACGGCAA 


1451 


ATATACCATC 


ACCAGTGGCC 


TGG-^JIGGAGC 


CTGGTCGAC 





(2) Information for sequence ID No . 2: 
(i) Sequence characteristics; 
(A) Length: 1181 base pairs 
5 (B) Type: Nucleic acid 

(C) Strandedness : double 

(D) Topology: linear 

(E) Strand: sense 

10 (ii) Molecule type: DNA (genomic) 



(iii) Sequence description: SEQ ID No. 2: 



i 




TGGAAAAAAA 


GCCAAAATAA AAAATTGCCC 


ATCCv-AGCkjC 






AAAb 1 AvjoUL 


TGTTCTGTCC 


GGTATTTAJVA 


J. oCAi T'oACC 


U i 




i lAAACAAiG 


TGATAAATTA 


CTCCGTTACC 


G G AAAAC -v- G C 


c 1 






AAaAGGATCC 


GCCGTTATCT 


GTTGCATTTC 


U 1 




oAu iALjUUAtj 


AGACACAATG 


ATCTGTGCCG 


TTCTGTTAAT 


I 


A i L- AAAU LrU 


J. AC i LAAi Ai 


CTTCTCTGGC 


GCTGGCTGCC 


ATCATCCGGA 


01 


AGCGTTCCGG 


TCGGGATAAA 


AAATCGCGCA 


GTGv^GCCGGT 


CCATGCAGAC 


51 


ACATCCCCCA 


CGGGTAACAG 


CGTCCCTGTC 


■j^^ m Q rn rp ^ i-in 


GAATGACATC 


0 1 


r\GGGATCCCG 


CCCGTCTCAC 


TGGCGATAAC 


GGGCACGCCG 


GAGACTGACG 




C i i L-A(jrCUAo 


X AC AT AC C A 


AACGCTTCAT 


TTTCCGAAGG 


CATGACCACC 


1 


Av^At. i vjjIjU A**- 


IXI,— J— /— ■^■T\ ,'— T\ /— 

i v-Coo * ALjAu 


CGGTA^VCGCT 


G G G AAAA G G G 


C AC C T G C CAT 


- 1 


.T' 'jf 'E' ^ n* 7^ 'P rr^ 


u> C CAC CSC C C 


GcsGGCEsa;(r£ 


C GS.G C GSG G^ 




O 1 


i i vUAl L 1 


i C A'o C i 'jA i A 


CA-ATACACGC 


AGCATAA^^.TT 


CATGTCCTTT 


V 1 


i i (-L3^j3^aAL.(j i 


AC C A ± C C ^ C A 


CCTGAACGAT 


AAGCGGA^^CA 


TTGTCTGCT3 


D 1 


A i U L ACjtC w 1^ A 


CCVw C 1 cCA I A 


TGCAGGGGTA ACGGTCGCAT 


GGCTTCATTA 


•J ± 




o^CAC 1 CC-r-A. 


ACCCGGTGGA. 


ATAACCGTTA 


CCGGTG7CCT 




vjA^-AUL 1 i L. v-. 


C C C A 1 C AC i 


GCGCCATCAT 


GGGTGAGATA 


GGCACA^iCA-. 


01 


TGAAATCACA 


CAGATAATTC 


AGGGAA^CG 


TTCTGGTCTT 


ACGGGTGAT G 


51 


TAGGTTTTTT 


GTCTGACA^iT 


AGTGAJ^GCGG 


TGACAGCATA 


TCAGACGGCT 


01 


CAGTCCTGCT 


ATAT7ACTGT 


CATGGCCACT 


ATGGCAGATG 


ACCAGATCAG 


5 1 


G T T T AAAT T C 


C C C G A T AA C 


CGTCGAAGTC 




AG GAAG G T G A 


01 


AGGCTGTTCC 


TGA^LZi.GGAAT 


AA^JVGTGJi.CA 


TCATGCCCTC 


tttttctgg: 


c - 


TTCCGGAGCA 


ATTTTACTTT 


TTTCTCTGCA 


G 





Information for sequence ID No. 3: 

(i) Sequence characteristics: 

(A) Length: 22 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No . 3: 
IGAT GAAAGCACCAX TGT G 



- 37 - 

(2) Information for sequence ID No . 4: 

(i) Sequence characteristics: 

(A) Length: 22 base pairs 

(B) Type: Nucleic acid 

5 (C) Strandedness : single 

(D) Topology: linear 

(E) Strand: antisense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No . 4: 
GGGCTGTGTAATCTCAGAGGAG 

(2) Information for sequence ID No . 5: 

(i) Sequence characteristics: 

(A) Length: 25 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 5: 
G T CC GGAGAT GAAAGCACCAC TG T G 

(2) Information for sequence ID No . 6 : 

(i) Sequence characteristics: 

(A) Length: 25 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 

(E) Strand: antisense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 6: 
7C AGGGC T G TG TAA.T C TC AGAGGAG 

(2) Information for sequence ID No . 7: 
(i) Sequence characteristics: 



15 



25 



30 



10 
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(A) Length: 23 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
GGCGCTGATACCGGCAAGAATGG 



(2) Information for sequence ID No. 8 
(i) Sequence characteristics: 

(A) Length: 23 base pairs 

(B) Type: Nucleic acid 
15 (C) Strandedness: single 

(D) Topology: linear 

(E) Strand: sense 



20 



(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
GGTCCCGCAGGCCATGATTTTTG 

(2) Information for sequence ID No . 9: 

(i) Sequence characteristics: 
25 (A) Length: 24 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 

(E) Strand: sense 

30 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No, 
CCGGCAAGAATGGTCGCAAACTCC 

35 (2) Information for sequence ID No. 10: 
(i) Sequence characteristics: 

(A) Length: 26 base pairs 

(B) Type: Nucleic acid 
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(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: antisense 

5 (ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 10 
AAGGGGTTCCAAGCCGCAACTGACGA 

(2) Information for sequence ID No. 11: 
10 (i) Sequence characteristics: 

(A) Length: 26 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 
15 (E) Strand: antisense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 11 
TAAGGGGTTCCAAGCCGCAACTGACG 

20 

(2) Information for sequence ID No. 12: 

( i ) Sequence characteristics : 

(A) Length: 31 base pairs 

(B) Type : Nucleic acid 
25 (C) Strandedness: single 

(D) Topology: linear 

( E ) Strand : sense 

( ii ) Molecule type : DNA ( genomic ) 

30 

(iii) Sequence description: SEQ ID No. 12 
CTC.-JICGGCATCGTCAGTTGCGGCTTGGAAC 

(2) Information for sequence ID No. 13: 
(i) Sequence characteristics: 
35 (A) Length: 31 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 
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(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

5 (iii) Sequence description: SEQ ID No. 13 

AGCACTCAACGGCATCGTCAGTTGCGGCTTG 

(2) Information for sequence ID No, 14: 

( i ) Sequence characteristics : 
(A) Length: 31 base pairs 

10 (B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

15 (ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 14 
CT.ATTTCAGGATACCCTTCGTCATCAACACG 

(2) Information for sequence ID No. 15: 
20 (i) Sequence characteristics: 

(A) Length: 31 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness: single 

(D) Topology: linear 
25 (E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 15 
AATTTCCCTTAATCCGGAGCTATTCGTATGA 

30 

(2) Information for sequence ID No. 16: 
(i) Sequence characteristics: 

(A) Length: 20 base pairs 

(B) Type: Nucleic acid 
35 (C) Strandedness: single 

(D) Topology: linear 

(E) Strand: sense 



(ii) Molecule type: DNA (genomic) 

(iii) Seqiience description: SEQ ID No. 
GAAGAC CAGC T TTTTGTTTC 

(2) Information for sequence ID No. 17: 

( i ) Sequence characteristics : 

(A) Length: 20 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
TG T C ACA.GA.C TC AATGAC TA 

(2) Information for sequence ID No. 18: 

( i ) Sequence characteristics : 

(A) Length: 14 base pairs 

(B) Type : Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
GGCATCGTCAGTTG 

(2) Information for sequence ID No. 19: 
(i) Sequence characteristics : 

(A) Length: 16 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology : linear 

(E) Strand: sense 



(ii) Molecule type: DNA (genomic) 
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(iii) Sequence description: SEQ ID No. 19: 
CGGCATCGTCAGTTGC 

(2) Information for sequence ID No . 20: 
( i ) Sequence characteristics : 
5 (A) Length: 18 base pairs 

(B) Type : Nucleic acid 

(C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

10 

{ ii ) Molecule type : DNA (genomic ) 

(iii) Sequence description: SEQ ID No. 20: 
ACGGCATCGTCAGTTGCG 

15 (2) Information for sequence ID No. 21: 

( i ) Sequence characteristics : 

(A) Length: 22 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

2 0 (D) Topology : linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

25 (iii) Sequence description: SEQ ID No. 21: 

C G ACC TGAACGATAAGCGGAAC 

(2) Information for sequence ID No. 22: 
( i ) Sequence characteristics : 
(A) Length: 22 base pairs 

3 0 (B) Type : Nucleic acid 

(C) Strandedness : single 

(D) Topology : linear 

(E) Strand : ant i sense 

3 5 ( ii ) Molecule type : DNA (genomic ) 

(iii) Sequence description: SEQ ID No. 22: 
CACCTTCCTTGCATCCTGAGAG 



Information for sequence ID No. 23: 

(i) Sequence characteristics: 

(A) Length: 20 base pairs 

(B) Type : Nucleic acid 

(C) Strandedness : single 

(D) Topology : linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
2CAGCGCGCTCCAGCTG 

Information for sequence ID No. 24: 

(i) Sequence characteristics: 

(A) Length: 22 base pairs 

(B) Type : Nucleic acid 

(C) Strandedness: single 

(D) Topology : linear 

(E) Strand: antisense 

(ii) Molecule type: DNA (genomic) 

(iii) Sequence description: SEQ ID No. 
CATGATGGCGCATCTGATG 

Information for sequence ID No. 25: 

(i) Sequence characteristics: 

(A) Length: 31 base pairs 

(B) Type: Nucleic acid 

(C) Strandedness : single 

(D) Topology : linear 

(E) Strand: sense 

( ii ) Molecule type : DNA (genomic) 

(iii) Sequence description: SEQ ID No 
TTCTGGTCTTACGGGTGATGTAGGTTTT 

Information for sequence ID No . 26: 
( i ) Sequence characteristics : 
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(A) Length: 31 base pairs 

(B) Type: Nucleic acid 

(C) Strandec±ness : single 

(D) Topology : linear 
5 (E) Strand : sense 

(ii) Molecule type: DNA (genomic) 

(iii) Seqiaence description: SEQ ID No. 26 
TAGTGAAGCGGTGACAGCATATCAGACGGCT 

10 

(2) Information for sequence ID No. 27: 
( i ) Seqiience characteristics : 

(A) Length: 21 base pairs 

(B) Type: Nucleic acid 
15 (C) Strandedness : single 

(D) Topology: linear 

(E) Strand: sense 

(ii) Molecule type: DNA (genomic) 

20 

(iii) Sequence description: SEQ ID No. 27 
GTGAGATAGGCACAACAATGA 



25 





SEQUENCE LISTING 

09/6742?? 

I JUL 0| 

<110> BIO- 

<120> Nucleoti^l^iS^uences for detecting enterohaemorrhagic 
Escherichia coli 



<130> BET 00/0916 



<140> US 09/674 277 

<141> 1999-04-27 

<150> FR 98/05329 

<151> 1998-04-28 



<160> 8 



<170> Patentin Ver . 2.1 



<210> 1 
<211> 1489 
<212> DNA 

<213> Escherichia coli 



<400> 1 

ctgcagtccg gagatgaaag caccactgtg 
tgatttttgt cacagactca atgactaccg 
gccagttaag ccagcggttt ccctgctgaa 
gggcggggga actccgtccg gccagtgaac 
ggcgctgata ccggcaagaa tggtcgcaaa 
atacccttcg tcatcaacac gtacaaacca 
aaagaaggga atattcaggt ctgcgcagca 
ccccttagta ttttttgtct gtagtatcta 
ataaagttga cttttgtata caacatgcga 
taaaaaaaac tcttcctgtt ctgattcttc 
tagccgctga taaaaaagag actcaaaatt 
ctctgagatt acacagccct gaatcaaatc 
gatttcaaca gctggatatg gaggctctga 
Gccaggattg gtgccctgcg gattatggtc 
ggcacggtgc cggaacatac aggacatatg 
aacgttttga accgctgaac agctggccgg 
tgctgtggcc agtcaagaaa aaatacggct 
tgactggtaa tgttgccctt gaatccatgg 
gagaagatga ctgggagtcg gacctggtat 
ataaccggga taaaaacggg aaacttcaga 
tttatgtcaa tcctgaaggc cccggtggaa 
tcagggaagc tttttcacgt atggccatgg 
gagggcatac atttggtaaa gcacatggtg 
ggcctgatgg tgcacctgtg gaggagcagg 
gaaacggcaa atataccatc accagtggcc 



tgtaccccat cagcgtggtc ccgcaggcca 60 
gacgcactga accttccggt tgtttctcca 120 
aaatgtcggc aaaacgggga agcatcagaa 180 
cgtgccacac tccgggcagt acatgccgcc 240 
ctcccgctcc gtgcagcggg ctatttcagg 300 
gaagaccagc tttttgtttc tgacatccac 360 
ctcaacggca tcgtcagttg cggcttggaa 4 20 
tcccagcaat aggtatatcc tgttgcatca 480 
atttccctta atccggagct attcgtatga 540 
tggcgctatc ggggagcttt tctaccgctg 600 
tctactatcc agaaacactg gatttaactc 660 
cctggggggc tgattttgat tatgccacca 720 
aaaaagatat caaagatttg ctgacaactt 780 
attatggtcc tttctttatt cgtatggctt 840 
atggccgggg aggcgccagt ggtggtcagc 900 
ataacgttaa tctggataaa gcccgtcgat 960 
ccagtatttc ctggggagac ctgatggtcc 1020 
gatttaaaac gctgggattt gctggcggaa 1080 
actgggggcc tgacaacaag cctcttgcag 1140 
aacctcttgc cgccacgcag atgggactta 1200 
aaccagatcc tctggcttcc gcgaaagata 1260 
atgatgagga gactgtggcc ctgatcgcgg 1320 
cagcgtctcc tgaaaaatgt attggcgcag 1380 
gactgggatg gaaaaataaa tgtggtacag 1440 
tggaaggagc ctggtcgac 1489 



<210> 2 
<211> 1181 
<212> DNA 

<213> Escherichia coli 



<400> 2 

ctgcaggaga tggaaaaaaa gccaaaataa 
aaagtaggcc tgttctgtcc ggtatttaaa 
tgataaatta ctccgttacc ggaaaaccgc 
gccgttatct gttgcatttc cccttagcct 
ttctgttaat atcaaaccgg tactcaatat 
agcgttccgg tcgggataaa aaatcgcgca 
cgggtaacag cgtccctgtc acattcttct 
tggcgataac gggcacgccg gagactgacg 



aaaattgccc atcccagcgc gctccagctg 60 

tgcattgacc gtccccgtat ttaaacaatg 12 0 

tgaacaaaat tcgggctgaa aagaggatcc 180 

gactagccag agacacaatg atctgtgccg 240 

cttctctggc gctggctgcc atcatccgga 300 

gtgcgccggt ccatgcagac acatccccca 3 60 

gaatgacatc agggatcccg cccgtctcac 420 

cttcagccag taccatacca aacgcttcat 480 



tttccgaagg 
cacctgccat 
cttcgtattc 
tcagctgata 
cctgaacgat 
acggtcgcat 
ccggtgtcct 
tgaaat caca 
gtctgacaat 
catggccact 
tgaggatgga 
tttttctggc 



cat gaccacc 
taacacat ct 
ttcacgcccg 
caatacacgc 
aagcggaaca 
ggcttcatta 
gacaccttcc 
cagataattc 
agtgaagcgg 
atggcagatg 
aggaaggtga 
ttccggagca 



acactggcaa 
ccgctcattc 
gcgcccacca 
agcataaatt 
ttgt ctgctg 
tgcaatgcgg 
gccatcagat 
agggaaaacg 
tgacagcata 
accagat cag 
aggctgttcc 
attttacttt 



tccggtagac 
ccaggtgtt c 
cgagccagcg 
catgtccttt 
atgcagccca 
gccagtcgaa 
gcgccatcat 
ttctggtctt 
tcagacggct 
gtttaaattc 
tgaaaggaat 
tttctctgca 



cggtaacgct 
tgt ct gctga 
aaatgatttc 
ttcgggacgt 
ggcgtggata 
acccggtgga 
gggtgagata 
acgggtgatg 
cagtcctgct 
cccgataatc 
aaaagtgaca 



gggaaaaggg 
cgcagacgtg 
ccttccatct 
agcatcccca 
tgcaggggta 
ataaccgtta 
ggcacaacaa 
taggtttttt 
atatt actgt 
cgtcgaagtc 
tcatgccct c 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1181 



<210> 3 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 3 

cggagatgaa agcaccactg tg 



<210> 4 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 4 

gggctgtgta atctcagagg ag 



<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

gtccggagat gaaagcacca ctgtg 



<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

tcagggctgt gtaatctcag aggag 



<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

ggcgctgata ccggcaagaa tgg 



<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 8 

ggtcccgcag gccatgattt ttg 



<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

ccggcaagaa tggtcgcaaa ctcc 



<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

aaggggttcc aagccgcaac tgacga 



<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

taaggggttc caagccgcaa ctgacg 



<210> 12 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 12 

ctcaacggca tcgtcagttg cggcttggaa c 



<210> 13 
<211> 31 
<:212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

agcactcaac ggcatcgtca gttgcggctt g 



<210> 14 
<211> 31 

<212> DNA I 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ctatttcagg atacccttcg tcatcaacac g 



<210> 15 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

aatttccctt aatccggagc tattcgtatg a 



<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

gaagaccagc tttttgtttc 



<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

tgtcacagac tcaatgacta 



<210> 18 
<211> 14 
<212> DNA 

<213> Artificial Sequence 




<220> 

<223> Description of Artificial 

<400> 18 
ggcatcgtca gttg 

<210> 19 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 19 

cggcatcgtc agttgc 

<210> 20 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 20 

acggcatcgt cagttgcg 

<210> 21 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 21 

ccacctgaac gataagcgga ac 

<210> 22 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 22 

caccttcctt ccatcctcag ac 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 23 

atcccagcgc gctccagctg 



Sequence: primer 



14 



Sequence: primer 



16 



Sequence: primer 



18 



Sequence : primer 



22 



Sequence: primer 



22 



Sequence : primer 



<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

acccatgatg gcgcatctga tg 



<210> 25 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

acgttctggt cttacgggtg atgtaggttt t 



<210> 26 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 

tagtgaagcg gtgacagcat atcagacggc t 



<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

gtgagatagg cacaacaatg a 



wo 99/55908 



PCT/FR99/ 01000 



CIAIMS 

1. Isolated nucleotide sequence comprising the 
nucleic sequence SEQ ID No. 1 or the nucleic sequence 
SEQ ID No, 2, their complementary sequences, the 
fragments and derived sequences thereof, differing by 
mutation, insertion, deletion and/or substitution of 
one or more bases and hybridizing under high stringency 
conditions with the sequences SEQ ID No. 1 and 
SEQ ID No. 2, respectively. 

2 . Isolated nucleotide sequence comprising the 
sequence SEQ ID No. 1, the sequences complementary 
thereto and the sequences derived therefrom, comprising 
a nucleotide chain resulting from the stable 
combination of at least a portion of the insertion 
sequence JS91 and at least a portion of the sequence of 
the katP gene. 

3 . Isolated nucleotide sequence according to Claim 

2, comprising at least 8, advantageously 10, preferably 
14 consecutive nucleotides of the chain of the sequence 
SEQ ID No. 1, including the nucleotides from position 
400 to 407. 

4. Isolated nucleotide sequence comprising at 
least 8 consecutive nucleotides of the sequence 
SEQ ID No. 1 or of the sequence SEQ ID No . 2 , or of 
sec[uences complementary thereto and derived therefrom, 
as defined in Claim 1. 

5. Isolated nucleotide sequence according to Claim 4, 

selected from the following nucleic sequences : 

SEQ ID No. 3:5'- CGGAGATGAAAGCACCACTGTG - 3' 
SEQ ID No, 4 : 5' - GGGCTGTGTAATCTCAGAGGAG - 3' 
SEQ ID No. 5 : 5' - GTCCGGAGATGAAAGCACCACTGTG - 3' 
SEQ ID No. 6 : 5^ - TCAGGGCTGTGTAATCTCAGAGGAG - 3' 
SEQ ID No. 7:5'. GGCGCTGATACCGGCAAGAATGG - 3' 
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SEQ ID No. 8 : 5' - GGTCCCGCAGGCCATGATTTTTG - 3' 
SEQ ID No. 9 : 5' - CCGGCAAGAATGGTCGCAAACTCC - 3' 
SEQ ID No. 10 : 5" - AAGGGGTTCCAAGCCGCAACTGACGA - 3' 
SEQ ID No. 11 : 5' - TAAGGGGTTCCAAGCCGCAACTGACG - 3' 
SEQ ID No. 12:5'- CTCAACGGCATCGTCAGTTGCGGCTTGGAAC - 3' 
SEQ ID No. 13 : 5' - AGCACTCAACGGCATCGTCAGTTGCGGCTTG - 3' 
SEQ ID No. 14 : 5' - CTATTTCAGGATACCCTTCGTCATCAACACG - 3' 
SEQ ID No. 15:5' - AATTTCCCTTAATCCGGAGCTATTCGTATGA - 3' 
SEQ ID No. 16 : 5' - GAAGACGAGC I i I i I GTTTC - 3' 
SEQ ID No. 17 : 5' - TGTCACAGACTCAATGACTA - 3' 
SEQ ID No. 18 : 5' - GGCATCGTCAGTTG - 3' 
SEQ ID No. 19 : 5' - CGGCATCGTCAGTTGC - 3' 
SEQ ID No. 20 : 5- - ACGGCATCGTCAGTTGCG - 3' 
SEQ ID No. 21 : 5' - CCACCTGAACGATAAGCGGAAC - 3' 
SEQ ID No. 22 : 5' - CACCTTCCTTCCATCCTCAGAC - 3' 
SEQ ID No. 23 : 5' - ATCCCAGCGCGCTCCAGCTG - 3" 
SEQ ID No. 24 : 5' - ACCCATGATGGCGCATCTGATG - 3' 
SEQ ID No. 25 : 5' - ACGTTCTGGTCTTACGGGTGATGTAGGTTTT - 3' 
SEQ ID No. 26 : 5' - TAGTGAAGCGGTGACAGCATATCAGACGGCT - 3' 
SEQ ID No. 27 : 5' - GTGAGATAGGCACAACAATGA - 3' 
6. Pairs of isolated nucleotide sequences 

according to Claim 4 or 5 , used as primers, selected 
from the following pairs of the following sequences -. 

SEQ ID No. 3 and SEQ ID No. 4 
SEQ ID No. 5 and SEQ ID No. 6 
SEQ ID No. 6 and SEQ ID No. 7 
SEQ ID No. 6 and SEQ ID No. 8 
SEQ ID No. 6 and SEQ ID No. 9 
SEQ ID No. 21 and SEQ ID No. 22 
SEQ ID No. 23 and SEQ ID No. 24 



15 7 . Isolated nucleotide sequence according to Claim 

4 or 5, used as probe, selecting from the following 



- 29 - 

sequences: SEQ ID No. 14, SEQ ID No. 25, SEQ ID No, 15, 
SEQ ID No, 26, SEQ ID No. 18, and SEQ ID No. 27. 
8. Isolated nucleotide sequence according to Claiin 

7, characterized in that it is labelled. 
5 9 . Isolated nucleotide sequence according to Claim 

7, characterized in that it is immobilized on a 
support . 

10. Plasmids pDF3 and pDF4 deposited at the 
Collection Nationale de Cultures de Microorganismes 

10 respectively under the numbers 1-1999 and 1-2000, on 26 
March 199 8. 

11. Host cell comprising a plasmid according to 
Claim 10 . 

12. Method for the detection of E. coli 0157 :H7 or 
15 EHECs in a sample, comprising the following steps: 

(a) bringing the sample into contact with a 
pair of oligonucleotide primers chosen from the 
oligonucleotides defined in Claim 5; the nucleic acid 
contained in the sample having been, where appropriate, 

20 made accessible to the hybridization of the primers 
with the nucleic acid of the target tested fo,r, 

(b) amplifying the nucleic sequence flanked by 
the pair of primers chosen, 

(c) verification of the presence of the 
25 amplified product by the use of at least one probe 

specific for the amplified product. 

13 . Method according to Claim 12 , according to 
which step (c) comprises the following substeps: 

(ci) denaturation of the amplified sequences by 

3 0 a physical or chemical means, 

(C2) bringing a solution containing the 
denatured amplified fragments of step (Ci) into contact 
with, on the one hand, at least one capture probe, and 
on the other hand, at least one detection probe, 

35 optionally labelled, the capture and detection probes 
having a sequence as defined in Claim 1, and capable of 
hybridizing with the same strand of the amplified 
fragments, the said bringing into contact being 



performed for a period sufficient to allow the 
hybridization reaction, 

(C3) at least one washing in order to remove 
the unreacted nucleic sequences, 
5 (C4) visualization of the detection probes 

hybridized with the amplified nucleic sequences. 
14. Method according to Claim 12 or 13, in which 

, the capture probe is attached to the surface of a well 
of a microtitration plate. 
10 15. Method according to Claim 12 or 13, in which 

the detection probe is labelled with peroxidase. 

16. Method according to any one of Claims 13 to 15, 
characterized in that the detection of the activity of 
the peroxidase linked to the detection probe which has 

15 reacted, is carried out by colorimetric reaction, in 
the presence of a chromogenic substrate, such as 
tetramethylbenzidine (TMB) , using the following steps: 

- addition of the chromogenic substrate, such 
as a TMB solution, to the wells containing the reaction 

2 0 mixture, 

- incubation, in the dark, for a sufficient 
period to allow the colour to develop, 

- blocking of the reaction by addition of a 
blocking solution , 

25 - determination of the optical density at an 

appropriate wavelength. 

17. Method for the detection of E, coll 0157 :H7, 
according to any one of Claims 12 to 16, using the 
following oligonucleotides : 

3 0 - the sequences SEQ ID No . 5 and SEQ ID No . 6, 

as primers for the amplification, 

- the sequence SEQ ID No. 15, as capture probe, 

- the sequence SEQ ID No. 18 as detection 

probe . 

35 18. Method for the detection of the EHECs, 

according to any one of Claims 12 to 16, using the 
following oligonucleotides: 

- the sequences SEQ ID No. 21 and 
SEQ ID No. 22, as primers for the amplification, 



■ - the sequence SEQ ID No. 25, as capture probe, 

- the sequence SEQ ID No. 27, as detection 

probe . 

19. Kit for the detecti on of E. coll 0157 :H7 or 

5 EHECs, comprising among the reagents: 

- at least two oligonucleotides according to 
Claim 5, used as a pair of primers, 

- optionally at least one oligonucleotide probe 
according to Claim 5, for the detection of the 

10 amplified product. 
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Abstract , ■ ^ - 

The invention concerns nucleic sequences of plasmid origin, present in bacteria of the group 
enterohemorrhagic Escherichia coh (EHEC), the use of said sequences for searcKing for EHE,C, 
in particular those having genes coding for enterohemolysin and intimin virulence^factors, and ^ y 
more particularly for specific detection of the 0157:H7 serotype. The invention als6.^ concerns a*' 
method using said sequences and detection kits containing them. 
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